N-gram-based SMT System Enhanced with Reordering Patterns
نویسندگان
چکیده
This work presents translation results for the three data sets made available in the shared task “Exploiting Parallel Texts for Statistical Machine Translation” of the HLT-NAACL 2006 Workshop on Statistical Machine Translation. All results presented were generated by using the Ngram-based statistical machine translation system which has been enhanced from the last year’s evaluation with a tagged target language model (using Part-Of-Speech tags). For both Spanish-English translation directions and the English-to-French translation task, the baseline system allows for linguistically motivated sourceside reorderings.
منابع مشابه
N-gram-based machine translation enhanced with neural networks for the French-English BTEC-IWSLT'10 task
Neural Network Language Models (NNLMs) have been applied to Statistical Machine Translation (SMT) outperforming the translation quality. N -best list rescoring is the most popular approach to deal with the computational problems that appear when using huge NNLMs. But the question of “how much improvement could be achieved in a coupled system” remains unanswered. This open question motivated som...
متن کاملThe Operation Sequence Model - Combining N-Gram-Based and Phrase-Based Statistical Machine Translation
In this article, we present a novel machine translation model, the Operation Sequence Model (OSM), that combines the benefits of phrase-based and N-gram-based SMT and remedies their drawbacks. The model represents the translation process as a linear sequence of operations. The sequence includes not only translation operations but also reordering operations. As in Ngram-based SMT, the model is: ...
متن کاملAn Ngram-based reordering model
This paper describes in detail a novel approach to the reordering challenge in statistical machine translation (SMT). This Ngram-based reordering (NbR) approach uses the powerful techniques of SMT systems to generate a weighted reordering graph. Thus, statistical criteria reordering constraints are supplied to an SMT system, and this allows an extension to the SMT decoding search. The NbR appro...
متن کاملSyntax-enhanced N-gram-based SMT
This paper addresses the problem of word reordering in statistical machine translation. We follow a word order monotonization strategy making use of syntax information (dependency parse tree) of the source language to build a set of automatically extracted reordering rules. The input sentence is extended to a graph built with reordering hypotheses, hence, allowing for a constrained search on th...
متن کاملIntegration of POStag-based source reordering into SMT decoding by an extended search graph
This paper presents a reordering framework for statistical machine translation (SMT) where source-side reorderings are integrated into SMT decoding, allowing for a highly constrained reordered search graph. The monotone search is extended by means of a set of reordering patterns (linguistically motivated rewrite patterns). Patterns are automatically learnt in training from word-to-word alignmen...
متن کامل